Logarithmic Equal-Letter Runs for BWT of Purely Morphic Words
نویسندگان
چکیده
In this paper we study the number $$r_{\texttt {bwt}}$$ of equal-letter runs produced by Burrows-Wheeler transform (BWT) when it is applied to purely morphic finite words, which are words generated iterating prolongable morphisms. Such a parameter very significant since provides measure performances BWT, in terms both compressibility and indexing. particular, prove that, BWT whichever word on binary alphabet, $$\mathcal {O}(\log n)$$ , where n length word. Moreover, that $$\varTheta (\log for large class These bounds proved providing some new structural properties bispecial circular factors such words.
منابع مشابه
Extremal words in morphic subshifts
Given an infinite word x over an alphabet A, a letter b occurring in x, and a total order σ on A, we call the smallest word with respect to σ starting with b in the shift orbit closure of x an extremal word of x. In this paper we consider the extremal words of morphic words. If x = g(fω(a)) for some morphisms f and g, we give two simple conditions on f and g that guarantees that all extremal wo...
متن کاملLogarithmic density in morphic sequences par
We study the logarithmic frequency of letters and words in morphic sequences and show that this frequency must always exist, answering a question of Allouche and Shallit.
متن کاملOptimal-Time Text Indexing in BWT-runs Bounded Space
Indexing highly repetitive texts — such as genomic databases, software repositories and versioned text collections — has become an important problem since the turn of the millennium. A relevant compressibility measure for repetitive texts is r, the number of runs in their Burrows-Wheeler Transform (BWT). One of the earliest indexes for repetitive collections, the Run-Length FMindex, used O(r) s...
متن کاملAlmost Rich Words as Morphic Images of Rich Words
We focus on Θ-rich and almost Θ-rich words over a finite alphabet A, where Θ is an involutive antimorphism over A∗. We show that any recurrent almost Θ-rich word u is an image of a recurrent Θ′-rich word under a suitable morphism, where Θ′ is also an involutive antimorphism. Moreover, if the word u is uniformly recurrent, we show that Θ′ can be set to the reversal mapping. We also treat one spe...
متن کاملOn the Periodicity of Morphic Words
Given a morphism h prolongable on a and an integer p, we present an algorithm that calculates which letters occur infinitely often in congruent positions modulo p in the infinite word h(a). As a corollary, we show that it is decidable whether a morphic word is ultimately p-periodic. Moreover, using our algorithm we can find the smallest similarity relation such that the morphic word is ultimate...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Lecture Notes in Computer Science
سال: 2022
ISSN: ['1611-3349', '0302-9743']
DOI: https://doi.org/10.1007/978-3-031-05578-2_11